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Computer defense is not perfect. Each year hackers exploit hundreds of vulnerabilities In 
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This paper describes the general methodology of specifying parallel systems within the 
PAPS (Performance Analysis of Parallel Systems) toolset and presents a case study that 
shows the applicability and accuracy of the Petri net based performance prediction tools 
contained in the toolset. Parallel systems are specified in the PAPS toolset by separately 
defining the program workload, the hardware resources, and the mapping of the program 
to the hardware. The resource parameterization is described ... 
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Most performance debugging and tuning of parallel programs is based on the "measure- 
modify" approach, which is heavily dependent on detailed measurements of programs 
during execution. This approach is extremely time-consuming and does not lend itself to 
predicting performance under varying conditions. Analytic modeling and scalability analysis 
provide predictive power, but are not widely used in practice, due primarily to their 
emphasis on asymptotic behavior and the difficulty of developing acc ... 
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Experience has shown that many widely used benchmarks are poor predictors of the 
performance of systems running commercial applications. Research into this anomaly has 
long been hampered by a lack of address traces from representative multi-user commercial 
workloads. This paper presents research, using traces of industry-standard commercial 
benchmarks, which examines the characteristic differences between technical and 
commercial workloads and illustrates how those differences affect cache ... 
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Current I/O benchmarks suffer from several chronic problems: they quickly become 
obsolete; they do not stress the I/O system; and they do not help much in understanding 
I/O system performance. We propose a new approach to I/O performance analysis. First, 
we propose a self-scaling benchmark that dynamically adjusts aspects of Its workload 
according to the performance characteristic of the system being measured. By doing so, the 
benchmark automatically scales across current and future systems ... 
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A technique is described to automatically compile performance models in the course of 
program translation. The performance models are fully symbolic in order to preserve as 
much diagnostic information as possible. Although compiled statistically, the models 
account for the effects of resource contention, due to the introduction of a novel algorithm 
within the symbolic compilation scheme. It is shown that the compilation approach 
fundamentally outperforms traditional static estimation proced ... 
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In trace-driven simulation, traces generated for one set of system characteristics are used 
to simulate a system with different characteristics. However, the execution path of a 
multiprocessor workload may depend on the order of events occurring on different 
processing elements. The event order, in turn, depends on system charcterlstlcs such as 
memory-system latencies and buffer-sizes. Trace-driven simulations of multiprocessor 
workloads are inaccurate unless the dependencies are eliminated from ... 
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Current I/O benchmarks suffer from several chronic problems: they quickly become 
obsolete, they do not stress the I/O system, and they do not help in understanding I/O 
system performance. We propose a new approach to I/O performance analysis. First, we 
propose a self-scaling benchmark that dynamically adjusts aspects of its workload 
according to the performance characteristic of the system being measured. By doing so, the 
benchmark automatically scales across current and future systems. The eval ... 
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Due to the increasing disparity between processor and main memory system cycle times, 
many computer systems are now incorporating two levels fo cache memory. Several studies 
have been done on the design and performance of second level caches, Including [3] and 
[20], It certainly can and has been shown that the addition of a second level of cache 
enhances the performance of many systems. 
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In order to take on the challenge of fully automatic program parallelizing, one of the last 
and probably the most decisive missing tool is a performance estimation system. In this 
paper a new performance prediction tool Is Introduced, which automatically derives 
performance estimates for single program multiple data (SPMD) parallel Fortran 77 
programs based on distributed memory systems (DMS). The underlying methodology Is 
based on static and dynamic techniques. This paper discusses in part ... 
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Future transaction processing systems may have substantially higher levels of concurrency 
due to reasons which Include: (1) Increasing disparity between processor speeds and data 
access latencies, (2) large numbers of processors, and (3) distributed databases. Another 
Influence is the trend towards longer or more complex transactions. A possible consequence 
is substantially more data contention, which could limit total achievable throughput. In 



H 



http://portal.acm.org/results.cfm?queiy=%2B%22computeiyo20system%22%20%2B%22PE... 6/19/05 



Results (page 4): +"computer system" +TERFORMANCE PREDICTION" +"system per... Page 5 of 5 



particular; it is Icnown tliat tlie usual locking ... 
Keywords: concurrency control, transaction processing 



79 An analysis of dynamic page piacement on a NUMA nriultiprocessor Q 
Richard P. LaRowe, Mark A. Holliday, Caria Schlatter Ellis 

June 1992 ACM SIGMETRICS Performance Evaluation Review , Proceedings of the 

1992 ACM SIGMETRICS joint international conference on Measurement and 
modeling of computer systems, volume 20 issue 1 

Full text available: Wi pdfil .08 m\ Additional Information: Ml.c^ltation, abstract, refemnces. citiag.s, index 

The class of NUMA (nonuniform memory access time) shared memory architectures is 
becoming increasingly important with tlie desire for larger scale multiprocessors. In such 
machines, the placement and movement of code and data are crucial to performance. The 
operating system can play a role in managing placement through the policies and 
mechanisms of the virtual memory subsystem. In this paper, we develop an analytic model 
of memory system performance of a Local/Remote NUMA architecture based ... 
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This paper describes different methods and techniques used to model, analyze, evaluate 
and implement distributed file systems. Distributed file systems are characterized by the 
distributed system hardware and software architecture, in which they are implemented as 
well as by the file systems' functions. In addition, distributed file system performance 
depends on the load executed in the system. Modeling and analysis of distributed file 
systems requires new methods to approximate complexity of th ... 
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